Multilingual Data-Driven Pronunciation

نویسنده

  • R. I. Damper
چکیده

Automatic pronunciation of unknown words is a hard problem of great importance in speech technology. The difficulty of the problem appears to vary across languages, according to the ‘depth’ of orthography, although there are, as yet, little by way of quantitative comparisons. Early solutions were based on manually-written expert rules but these are expensive to derive and maintain, are entirely language-specific and—according to more recent evaluations using large test sets—do not perform very well (at least for English). By contrast, data-driven methods (which infer pronunciations from a set of examples) require little expert knowledge, can be quickly and easily applied to new languages for which an appropriate set of examples is available, and seem to perform far better than rules. In this paper, we compare the success of data-driven automatic pronunciation for four European languages: English, French, Frisian and German. We also attempt to quantify the difficulty of the problem using the entropy of the association between letters and phonemes found after prior alignment of the various dictionaries.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multilingual Pronunciat Improving Multilingual S

Multilinguality aspects are becoming increasingly important in the Automatic Speech Recognition (ASR) systems. It is apparent that coping with large variability of the speech signal is an even bigger challenge in multilingual ASR systems than it has been in conventional monolingual systems. In this paper, we address the importance of combining multilingual pronunciation modeling and acoustic mo...

متن کامل

Pronunciation and Acoustic Model Adaptation for Improving Multilingual Speech Recognition

In this paper, we address the importance of pronunciation and acoustic model adaptation in multilingual speech recognition. When aiming at modeling several languages simultaneously, the degree of speaker and language variability is even greater than when concentrating on only one language. To compensate the pronunciation variability across various speaker, bi-lingual pronunciation modeling is p...

متن کامل

GlobalPhone: Pronunciation Dictionaries in 20 Languages

This paper describes the advances in the multilingual text and speech database GLOBALPHONE a multilingual database of high-quality read speech with corresponding transcriptions and pronunciation dictionaries in 20 languages. GLOBALPHONE was designed to be uniform across languages with respect to the amount of data, speech quality, the collection scenario, the transcription and phone set convent...

متن کامل

Towards multilingual interoperability in automatic speech recognition

In this communication, we address multilingual interoperability aspects in speech recognition. After giving a tentative definition of multilingual interoperability, we discuss speech recognition components and their language-specific aspects. We give a sample overview of past multilingual speech recognition research and development across different speaking styles (read, prepared and conversati...

متن کامل

Multilingual phone clustering for recognition of spontaneous indonesian speech utilising pronunciation modelling techniques

In this paper, a multilingual acoustic model set derived from English, Hindi, and Spanish is utilised to recognise speech in Indonesian. In order to achieve this task we incorporate a two tiered approach to perform the cross-lingual porting of the multilingual models to a new language. In the first stage, we use an entropy based decision tree to merge similar phones from different languages int...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009